Skip to content

Conversation

@daiyaanarfeen
Copy link

Overview:

Changes Planner so it collects backend metrics instead of frontend metrics to decide whether to scale

Details:

Planner prometheus client searches for backend metrics when using vLLM, otherwise defaults to frontend metrics

Where should the reviewer start?

components/src/dynamo/planner/utils/planner_core.py
components/src/dynamo/planner/utils/prometheus.py

Related Issues: (use one of the action keywords Closes / Fixes / Resolves / Relates to)

  • closes GitHub issue: #xxx

@copy-pr-bot
Copy link

copy-pr-bot bot commented Nov 5, 2025

This pull request requires additional validation before any workflows can run on NVIDIA's runners.

Pull request vetters can view their responsibilities here.

Contributors can view more details about this message here.

@daiyaanarfeen daiyaanarfeen changed the title vllm metrics pulled from backend feat: vllm metrics pulled from backend Nov 5, 2025
@github-actions github-actions bot added the feat label Nov 5, 2025
@daiyaanarfeen daiyaanarfeen force-pushed the darfeen/migrate-planner-metrics branch from 8240a72 to 3ae056d Compare November 6, 2025 17:51
Signed-off-by: Daiyaan <[email protected]>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants